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Abstract 

We propose a new algebraic framework to discuss and classify recognizable tree 
languages, and to characterize interesting classes of such languages. Our algebraic 
tool, called preclones, encompasses the classical notion of syntactic S-algebra or 
minimal tree automaton, but adds new expressivity to it. The main result in this 
paper is a variety theorem a la Eilenberg, but we also discuss important examples 
of logically defined classes of recognizable tree languages, whose characterization 
and decidability was established in recent papers (by Benedikt and Segoufin, and 
by Bojaiiczyk and Walukiewicz) and can be naturally formulated in terms of pseu- 
dovarieties of preclones. Finally, this paper constitutes the foundation for another 
paper by the same authors, where first-order definable tree languages receive an 
algebraic characterization. 



1 Introduction 



The notion of recognizability emerged in the 1960s (Eilenberg, Mezei, Wright, 
and others, cf. [17,30]) and has been the subject of considerable attention since. 
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notably because of its close connections with automata-theoretic formalisms 
and with logical definability, cf. [6,15,18,38] for some early papers. 

Recognizabihty was first considered for sets (languages) of finite words, cf. [16] 
and the references contained in op. cit. The general idea is to use the algebraic 
structure of the domain, say, the monoid structure on the set of all finite words, 
to describe some of its subsets, and to use algebraic considerations to discuss 
the combinatorial or logical properties of these subsets. More precisely, a set 
of words is said to be recognizable if it is a union of classes in a (locally) finite 
congruence. The same concept was adapted to the case of finite trees, traces, 
finite graphs, etc, cf. [17,30,14,9], where it always entertains close connections 
with logical definability [11,12]. 

It follows rather directly from this definition of (algebraic) recognizabihty 
that a finite - or finitary - algebraic structure can be canonically associated 
with each recognizable subset L, called its syntactic structure. Moreover, the 
algebraic properties of the syntactic structure of L reflect its combinatorial 
and logical properties. The archetypal example is that of star-free languages 
of finite words: they are exactly the languages whose syntactic monoid is 
aperiodic, cf. [34]. They are also exactly the languages that can be defined by a 
first-order sentence of the predicate < (F0[<]), cf. [29], and the languages that 
can be defined by a temporal logic formula, cf. [27,22,7]. In particular, every 
algorithm we know for deciding the FO[<]-deflnability of a regular language 
L, works by checking, more or less explicitly, whether the syntactic monoid of 
L is aperiodic. 

Let E be a ranked alphabet. In this paper, we are interested in sets of fi- 
nite E-labeled trees, or tree languages. It has been known since the 1960s 
[17,30,15] that the tree languages that are definable in monadic second order 
logic are exactly the so-called regular tree languages, that is, those accepted 
by bottom-up tree automata. Moreover, deterministic tree automata suffice 
to accept these languages, and each regular tree language admits a unique, 
minimal deterministic automaton. From the algebraic point of view, the set 
of all S-labeled trees can be viewed in a natural way as a (free) S-algebra, 
where E is now seen as a signature. Moreover, a deterministic bottom-up tree 
automaton can be identified with a finite E-algebra, with some distinguished 
(final) elements. Thus regular tree languages are also the recognizable subsets 
of the free E-algebra. 

The situation however is not entirely satisfying, because we know very little 
about the structure of finite E-algebras, and very few classes of tree lan- 
guages have been characterized in algebraic terms, see [26,32,33] for attempts 
to use E-algebra-theoretic considerations (and some variants) for the purpose 
of classifying tree languages. In particular, the important problem of deciding 
whether a regular tree language is F0[<] -definable remained open [33]. Based 
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on the word language case, it is tempting to guess that an answer to this 
problem ought to be found using algebraic methods. 

In this paper, we introduce a new algebraic framework to handle tree lan- 
guages. More precisely, we consider algebras called preclones (they lack some 
of the operations and axioms of clones [13]). Precise definitions are given in 
Section 2.1. Let us simply say here that, in contrast with the more classical 
monoids or E-algebras, preclones have infinitely many sorts, one for each inte- 
ger n > 0. As a result, there is no nontrivial finite preclone. The corresponding 
notion is that of finitary preclones, that have a finite number of elements of 
each sort. An important class of preclones is given by the transformations 
T{Q) of a set Q. The elements of sort (or rank) n are the mappings from 
into Q and the (preclone) composition operation is the usual composition of 
mappings. Note that T{Q) is finitary if Q is finite. 

It turns out that the finite S-labclcd trees can be identified with the 0-sort 
of the free preclone generated by S. The naturally defined syntactic preclone 
of a tree language L is finitary if and only if L is regular. In fact, if S is the 
syntactic E-algebra of L, the syntactic preclone is the sub-preclone of T{S) 
generated by the elements of E (if cr e E is an operation of rank r, it defines a 
mapping from S'^ into S, and hence an element of sort r in T(S)). Note that 
this provides an effectively constructible description of the syntactic preclone 
of L. 

It is important to note that the class of recognizable tree languages in the 
preclone-theoretic sense, is exactly the same as the usual one - we are simply 
adding more algebraic structure to the finitary minimal object associated with 
a regular tree language, and thus, we give ourselves a more expressive language 
to capture families of tree languages. 

In order to justify the introduction of such an algebraic framework, we must 
show not only that it offers a well-structured framework, that accounts for the 
basic notions concerning tree languages, but also that it allows the character- 
ization of interesting classes of tree languages. The first objective is captured 
in the definition of varieties of tree languages, and their connection with pseu- 
dovarieties of finitary preclones, by means of an Eilenberg-type theorem. This 
is not unexpected, but it requires combinatorially much more complex proofs 
than in the classical word case, the details of which can be found below in 
Section 5.1. 

As for the second objective, we offer several elements. First the readers will find 
in this paper a few simple but hopefully illuminating examples, which illus- 
trate similarities and differences with the classical examples from the theory of 
word languages. Second, we discuss a couple of important recent results on the 
characterization of certain classes of tree languages: one concerns the tree lan- 
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guagcs that arc definable in the first-order logic of successors (-FO(Succ)), and 
is due to Benedikt and Segoufin [3] ; the second one concerns the tree languages 
defined in the logics EF and EX, and is due to Bojanczyk and Walukiewicz 
[5]. Neither of these remarkable results can be expressed directly in terms 
of syntactic E-algebras; neither mentions preclones (of course) but both use 
mappings of arity greater than 1 on E-algebras, that is, they can be naturally 
expressed in terms of preclones, as we explain in Sections 5.2.2 and 5.2.3. It is 
also very interesting to note that the conditions that characterize these various 
classes of tree languages include the semigroup-theoretic characterization of 
their word language analogues, but cannot be reduced to them. 

Another such result, and that was our original motivation to introduce the 
formalism of preclones, is a nice algebraic characterization of FO[<]-definable 
tree languages (and a number of extensions of F0[<], such as the introduction 
of additional, modular quantifiers), briefly discussed in Section 5.2.4. Let us 
say immediately that we do not know yet whether this characterization can 
be turned into a decision algorithm! In order to keep this paper within a 
reasonable number of pages, this characterization will be the subject of another 
paper by the same authors [21]. The main results of this upcoming paper can 
be found, along with an outline of the present paper, in [20]. 

To summarize the plan of the paper. Section 2 introduces the algebraic frame- 
work of preclones, discussing in particular the all-important cases of free pre- 
clones, in which tree languages live (Section 2.2), and of preclones associated 
with tree automata (Section 2.3). Section 2.4 discusses in some details the 
notion of finite determination for a preclone, a finiteness condition different 
from being finitary, which is crucial in the sequel. Section 2.5 is included for 
completeness (and can be skipped at first reading): its aim is to make explicit 
the connection between our preclones and other known algebraic structures, 
namely magmoids and strict monoidal categories. 

Recognizable tree languages are the subject of Section 3. Here tree languages 
arc meant to be any subset of some EAf^, and the preclone structure on SM 
naturally induces a notion of recognizability, as well as a notion of syntactic 
preclone (Section 3.1). As pointed out earlier, the usual recognizable tree lan- 
guages, that is, subsets of SMq, fall nicely in this framework, and there is a 
tight connection between the minimal automaton of such a language and its 
syntactic preclone (Section 3.2). Specific examples are given in Section 3.3. 

Pseudovarieties of finitary preclones are discussed in detail in Section 4. As it 
turns out, this notion is not a direct translate of the classical notion for semi- 
groups or monoids, due to the infinite number of sorts. The technical treat- 
ment of these classes is rather complex, and we deal with it thoroughly, since 
it is the foundation of our construction. We show in particular that pseudova- 
rieties are characterized by their finitely determined elements (Section 4.1), 
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and we describe the pseudovarieties generated by a given set of finitary pre- 
clones, showing in particular that membership in a 1-generated pseudovariety 
is decidable (Section 4.2). 

Finally, we introduce varieties of tree languages and we establish the variety 
theorem in Section 5.1. Section 5.2 presents the examples described above, 
based on the results by Benedikt and Segoufin [3] and by Bojariczyk and 
Walukiewicz [5]. 



2 The algebraic framework 

In this section, we introduce the notion of preclones, a multi-sorted kind of 
algebra which is our central tool in this paper. In the sequel, if n is an integer, 
[n] denotes the set of integers {1, . . . , n}. In particular, [0] denotes the empty 
set. 



2. 1 Preclones and preclone-generators pairs 

Let Q be a set and let Tn{Q) denote the set of n-arj transformations of 
Q, that is, mappings from Q" to Q. Let then T{Q) be the sequence of sets 
of transformations T{Q) = {Tn{Q))n>o, which will be called the preclone of 
transformations of Q. The set 71 (Q) of transformations of Q is a monoid 
under the composition of functions. Composition can be considered on T(Q) 
in general: if / G Tn{Q) and gi G T„i.{Q) {I < i < n), then the composite 
h = f{gi,... ,gn), defined in the natural way, is an element of T„((5) where 
m = Eie[n] m- 

■ ■ ■ ' Qn,m„) = f{gi{Ql,l, ■ ■ ■ , Ql,mi), ■ ■ ■ , 5'n(?n,l) ■ ■ ■ ) ?n,m„)) 

for all gfjj e Q, 1 < i < n, 1 < j < rui. This composition operation and 
its associativity properties are exactly what is captured in the notion of a 
preclone. 

In general, a preclone is a many-sorted algebra S — ((S'„)„>o, •, 1). The ele- 
ments of the sets Sm where n ranges over the nonnegative integers, are said to 
have rank n. The composition operation • associates with each / G Sn and gi G 
Smi,...,gn G S'm„, an element •(/, gi,...,gn) G Sm, of rank m = J2ie[n] ^i- 
We usually write f ■ {gi® ■ ■ ■ (B g-n) for •{f,gi,..., gn). Finally, the constant 1 
is in Si. Moreover, we require the following three equational axioms: 
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(/ • (^1 e • • • e ^n)) • (^1 e • • • e /im) =/ • ((^i • ^i) e • • • e (^n • K)), (i) 

where f,gi,..., Qn are as above, hj G 5*^^. (j G and if we denote Z]jg[j] 
by m[i], then hi = hm.^_^+i © ■ ■ ■ © hm,^, for each i e [n]; 



!■/ = / (2) 
/•(1®---©1) = /, (3) 

where f & Sn and 1 appears n times on the left hand side of the last equation. 

Note that Axiom (1) generalizes associativity, and Axioms (2) and (3) can be 
said to state that 1 is a neutral element. 

Remcirk 2.1 The elements of rank 1 of a preclone form a monoid. □ 



It is immediately verified that T{Q), the preclone of transformation of a set Q, 
is indeed a preclone for the natural composition of functions, with the identity 
function idg as 1. Preclones are an abstraction of sets of n-ary transformations 
of a set, which generalizes the abstraction from transformation monoids to 
monoids. 

Remark 2.2 Clones [13], or equivalently, Lawvere theories [4,19] are an- 
other more classical abstraction. Readers interested in the comparison be- 
tween clones and preclones will have no difficulty tracing their differences in 
the sequel. We will simply point out here the fact that, in contrast with the 
definition of the clone of transformations of Q, each of the m arguments of 
the composite f{gi, . . . above is used in exactly one of the gfj's, the first 
mi in gi, the next m2 in g2, etc. □ 



We observe that a preclone with at least one element of rank greater than 1 
must have elements of arbitrarily high rank, and hence cannot be finite. We 
say that a preclone S is finitary if and only if each Sn is finite. For instance, 
the preclone of transformations T[Q) is finitary if and only if the set Q is 
finite. 

The notions of morphism between preclones, sub-preclone, congruence and 
quotient are defined as usual [25,39]. Note that, as is customary for multi- 
sorted algebras, a morphism maps elements of rank n to elements of the same 
rank, and a congruence only relates elements of the same rank. 

To facilitate discussions, we introduce the following short-hand notation. An 
n-tuple ((?!,..., gn) of elements of S will often be written as a formal ©-sum: 
gi (B ■ ■ ■ (B gn- Moreover, if g^ G 5^. (1 < i < n), we say that gi (B ■ ■ ■ (B gn has 
total rank m — mj. Finally, we denote by Sn,m the set of all n-tuples of 
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total rank m. With this notation, = Sn- The n-tuple 1 © ■ ■ ■ © 1 G Sn,n is 
denoted by n. If G is a subset of S, we also denote by Gn.m the set of n-tuples 
of elements of G, of total rank m. 

Observe that a preclone morphism (p: S ^ T, naturally extends to a map 
V- Sn,m Tn,m for each n,m>0, by mapping gi®- ■ -©gr^ to (p{gi)®- ■ -^^ign)- 

For technical reasons, it will often be preferable to work with pairs [S, A) 
consisting of a preclone S and a (possibly empty) set A of generators of S. We 
call such pairs preclone- generators pairs, or pgf-pairs. The notions of morphisms 
and congruences must be revised accordingly: in particular, a morphism of pg- 
pairs from (5*, A) to (T, B) must map A into B. A pg(-pair (5*, A) is said to be 
finitary if S is finitary and A is finite. 

Besides preclones of transformations of the form T„(Q), fundamental examples 
of preclones and pg-^aim are the free preclones and the preclones associated 
with a tree automaton. These are discussed in the next sections. 



2.2 Trees and free preclones 

Let E be a ranked alphabet, say, S = (E„)„>o, and let {vk)k>i be a sequence 
of variable names. We let EM„ be the set of finite trees whose inner nodes are 
labeled by elements of S (according to their rank), whose leaves are labeled 
by elements of Sq U {f i, . . . , t'„}, and whose frontier (the left to right sequence 
of variables appearing in the tree) is the word Vi ■ ■ - Vn- that is, each variable 
occurs exactly once, and in the natural order. Note that EMq is the set of 
finite E-labeled trees. We let EM = (EM„)„. 




gi 92 9n 



If / e EM„ and gi, . . . ,gn £ EM, the composite tree f ■ {gi Q) ■ ■ ■ (B gn) is 
obtained by substituting the root of the tree gi for the variable Vi in / for each 
i, and renumbering consecutively the variables in the frontiers of gi, . . . ,gn- 
Let also 1 e EMi be the tree with a single vertex, labeled Vi. Then (EM, •, 1) 
is a preclone. 

Each letter cr e E of rank n can be identified with the tree with root labeled 
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0", where the root's children are leaves labeled vi, . . . It is easily verified 
that every rank-preserving map from S to a preclone S can be extended in a 
unique fashion to a preclone morphism from EM into S. That is: 

Proposition 2.3 EM is the free preclone generated by E, and (EM, E) is the 
free pg-pair generated by E. 

Remark 2.4 If E„ = for each n ^ 1, then EM„ = for all n 7^ 1, and EMi 
can be assimilated with the set of all finite words on the alphabet Ei. 

If at least one E„ with n > 1 is nonempty, then infinitely many SM„ are 
nonempty, and if in addition Eq 7^ 0, then each EA/„ is nonempty. □ 



2.3 Examples of preclones 

We already discussed preclones of transformations and free preclones. The 
next important class of examples is that of preclones (and pg-pa.irs) associated 
with E-algebras and tree automata. We also discuss a few simple examples of 
preclones that will be useful in the sequel. 

2.3.1 Preclone associated with an automaton 

Let E be a ranked alphabet as above and let Q be a H-algehra: that is, Q 
is a set and each element cr G E„ defines an n-ary transformation of Q, i.e., 
a mapping a'^:Q^ — > Q. Recall that Q, equipped with a set F C Q of final 
states, can also be described as a (deterministic, bottom-up) tree automaton 
accepting trees in EMq, cf. [15,38,23,24,8]. 

More precisely, the mapping a ^ a'^ induces a morphism of E-algebras from 
EMo, viewed here as the initial E-algebra {i.e., the algebra of E-terms), to Q, 
say, val: EMq Q, and the tree language accepted by Q is the set var^(F) 
of trees which evaluate to an element of F. 

Now, since the elements of E„ can be viewed also as elements of Tn{Q), the 
mapping cr 1— > u'^ also extends to a preclone morphism r: EM — > T{Q), whose 
restriction to the rank elements is exactly the morphism val. The range of 
r is called the preclone associated with Q, and the pg-pair associated with Q, 
written pg(Q), is the pair (r(EM), t(E)). 

We observe in particular that a morphism of E-algebras ip:Q ^ Q' induces a 
morphism of pg^-pairs </?: pg((5) — > pg((5') in a functorial way. 

Conversely, if Q is a set and r: EM — > T{Q) is a preclone morphism such 
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that r(SMo) = Q, letting = r(o-) endows the set Q with a structure of 
E-algebra, for which the associated preclone is the range of r. 

In the sequel, when discussing decidability issues concerning preclones, we will 
say that a preclone is effectively given if it is given as the preclone associated 
with a finite S-algebra Q, that is, by a finite set of generators in T(Q). By 
definition, such a preclone is finitary. 



2.3.2 Simple examples of preclones 

The following examples of preclones and pgi-pairs will be discussed throughout 
the rest of this paper. 

Example 2.5 Let B be the 2-element set B = {true, false}, and let T3 be the 
subset of T(B) whose rank n elements are the n-ary or function and the n-ary 
constant true, written respectively or„ and true„. One verifies easily that T3 
is a preclone, which is generated by the binary or2 function and the nuUary 
constants trueo and falseo- That is, if E consists of these three generators, Tg 
is the preclone associated with the E-automaton whose state set is B. 

Moreover, the rank 1 elements of T3 form a 2-element monoid, isomorphic to 
the multiplicative monoid {0, 1}, and known as Ui in the literature on monoid 
theory, e.g. [31]. □ 

Example 2.6 Let p be an integer, p > 2 and let Bp = {0, 1, ... ,p — 1}. We 

let Tp be the subset of T{Mp) whose rank n elements {n > 0) are the mappings 
fn,r'- {f^i, ■ ■ ■ , f^n) ^ f^i + ■ ■ ■ + Tn + T mod p for < r < j9. It is not difficult to 
verify that Tp is a preclone, and that it is generated by the nuUary constant 
0, the unary increment function /i^i and the binary sum /2,o- 

As in Example 2.5, Tp can be seen as the preclone associated with a p-state 
automaton. Moreover, its rank 1 elements form a monoid isomorphic to the 
cyclic group of order p. □ 

Example 2.7 Let again B = {true, false}, and let Tpath be the subset of T(B) 
whose rank elements are the nuUary constants trueo and falseo, and the 
rank n elements (n > 0) are the n-ary constants true^ and false^, and the 
n-ary partial disjunctions orp (if P C [n], orp is the disjunction of the i- 
th arguments, i E P). One verifies easily that Tpath is a preclone, which is 
generated by the binary or2 function, the nuUary constants trueo and falseo, 
and the unary constant falsei. The rank 1 elements of Tpath form a 3-element 
monoid, isomorphic to the multiplicative monoid {l,a,b} with xy = y for 
x,y ^ 1, known as U2 in the literature on monoid theory, e.g. [31]. □ 
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2-4 Representation of preclones 



Section 2.3.1 shows the importance of the representation of preclones as pre- 
clones of transformations. It is not difficult to establish the following analogue 
of Cayley's theorem. 

Proposition 2.8 Every preclone can be embedded in a preclone of transfor- 
mations. 

Proof. Suppose that 5* is a preclone and let Q be the disjoint union of the 
sets Sn, n>0. For each / e Sn, let / be the function ^ Q given by 

7{9l,---,9n)=f ■ i9l®---®9n)- 

The assignment f ^ f defines an injective morphism S — > T(Q). □ 

This result however is not very satisfactory: it docs not tell us whether a 
finitary preclone can be embedded in the preclone of transformations of a 
finite set. It is actually not always the case, and this leads to the following 
discussion. 

Let A; > 0. Wc say that a preclone S is k-determined if distinct elements 
can be separated by /c-ary equations. Formally, let ~fe denote the following 
equivalence relation: for all f,geSn (n > 0), 

f 9 f ■ h = g ■ h, for all e Sn/ with £ < k. 

Note that for each i < k, is the identity relation on T^. Wc call S k- 
determined if the relation is the identity relation on each Sn, n > 0, and 
we say that S is finitely determined if it is A;-determined for some integer k. 
We also say that a pg'-pair {S, A) is /c-determined (resp. finitely determined) 
if S is. 

Example 2.9 The preclone of transformations of a set is 0-determined. □ 

We observe the two following easy lemmas. 

Lemma 2.10 For each k, is a congruence relation. 

Proof. Let f,g e Sn be ~jt-equivalent. For each i G [n], let fi,gi E Smi, such 
that fi gi. We want to show that / • (/i ® • • • /n) ~ifc • (fl'i ® • • • ® gn)- 

Let m = J2ie[n] l^t h e Sm,i for some £ < k. Then h is an m-tuple, and 

we let hi be the tuple of the first mi terms of h, /i2 consist of the next m2 
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elements, etc, until hn, which consists of the last elements of h. Note that 
each hi lies in some Smi/i and that J2ie[n] — ^- particular, £i < k for each 
i and we have 



/ • (/i © • • • e /„) • = / • (/i • /ii ® • • • e /„ • 

-(gi-hi^-'-^gn-hn) 

^g-{gi-hi®---®gn-K) 
^g-{gi®---®gn)-h. 

□ 

Lemma 2.11 For each k > 0, the quotient preclone S/'-^k is k-determined. 

Proof. Let T = S/^^k and let [/], [sf] G where [/] denotes the 
equivalence class of / (necessarily in S'n). Let i < k and assume that [/] ■ [h] ~ 
[g] • [h] for each h e T^/. Then f -h g-hior each h. But / • h and g-hlie in 

Se, and we already noted that is the identity relation on (since £ < k). 
Thus f ■ h = g ■ h ioT all h E Sn,e, and since this holds for each i < k, we have 
/ 5(, and hence [/] = [g]. □ 

We say that a preclone morphism ip: S —>■ T is k-injective if it is injective 
on each with i < k. The next lemma, relating /^-determination and k- 
injectivity, will be used to discuss embeddability of a finitary preclone in the 
preclone of transformations of a finite set. 

Lemma 2.12 Let S be a k-determined preclone. If (p: S ^ T is a k-injective 
morphism, then ip is injective. 

Proof. If (/?(/) = ip{g) for some f,g E Sn, then (/?(/ • h) = (/?(/) • ip(h) = 
h{g) ■ (p{h) = ip{g ■ h), for all h G Sn/ with i < k. Since (p is /c-injective, it 
follows f ■ h = g ■ h foT all h E Sn/ with i < k, and since S is /c-determined, 
this implies f = g. □ 

Proposition 2.13 Let S be a finitary and finitely determined preclone. Then 
there is a finite set Q such that S embeds in T{Q). If in addition S is 0- 
determined, the set Q can he taken equal to Sq. 

Proof. We modify the construction in the proof of Proposition 2.8. Let A; > 
be such that S is ^-determined, and let Q = {±} U Uj<fc Si, where the sets 
Si are assumed to be pairwise disjoint and ± is a new symbol, not in any of 
those sets. For each / e (n > 0), let (p{f) = f : ^ Q he the function 
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defined by 

J(qi, . . . , g„) = I ■ "^^i ® ■ ■ ■ ® a qi e Srrn: Qn ^ S„,^ and J2m < k, 
' " I ± otherwise. 

It is easy to check that 93 is a morphism. By Lemma 2.12, ip is injective. 

To conclude, we observe that if = 0, we can choose Q — Sq since rrii < 
is possible if and only if each rrii — 0. □ 

For later use we also note the following technical results. 

Proposition 2.14 Let S and T be preclones, with T k-determined. Let G he 
a (ranked) generating set of S and let (p:G ^ T be a rank-preserving map, 
whose range includes all ofT^, for each £ < k. Then ip can he extended to a 
preclone morphism Tp: S ^ T iff for all g e Gn, n > and for all h e Gn,i 
with i < k, 



ip{g-h)^ip{g)-ip{h). (4) 

Proof. Condition (4) is obviously necessary, and we show that it is sufficient. 

Let f & Sn, n > 0. Any possible image of / by a preclone morphism is an 
element g G T„ such that, if G Tn/ for some i < k and if h' G Gn,e is such 
that (p{h') = h, then g ■ h = ip{f ■ h'). Since T is fc-determined and each 
{I < k) is in the range of </?, the element g is completely determined by /. 
That is, if an extension of </? exists, then it is unique. 

We now show the existence of this extension. We want to assign an image to 
an arbitrary element / of and we proceed by induction on the height of an 
expression of / in terms of the elements of G; such an expression exists since 
G generates 5. If / G G, we let ^(/) = ¥'(/). Note also that if / = 1, then 
we let ^(/) = 1. If / ^ G, then f = g ■ h for some g E G (1 Sn and some 
h = hi Q) ■■■ (B hn- By induction, the elements ^(/ij) are well defined for each 
i e[n]. We then let ^(/) = ip{g) ■ {Tp{hi) © • • • © ^{hn)). 

To show that ^(/) is well-defined, we consider a different decomposition of /, 
say, f = g' ■ h' with h' = h[(B ■ ■ ■ (B h'^. If / has rank £ < k, then h G Snj and 
h' G Sm,e, so h = (p(h) and h' = (p(h') for some h G Gn/ and h' G Gm^. By 
Condition (4), we have 

^{g) ■ ^{h) = (p{g) ■ ip(h) = ip{g ■ h) = 

and by symmetry, p){g) ■ Tp(h) = (p{g') ■ ^{h'). So ip is well defined on all the 
elements of S of rank at most k. 
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Now if / has rank £ > k, let x & Ti^p with p < k. Then there exists x' G Gi^p 
such that X = ^{x'). Note that h ■ x' and h! ■ x' are well defined, in Sn^p and in 
Sm,p respectively. In particular, Ip{h • x') is well defined, and equal to Tp{h) ■ x. 
Similarly, ^(/i' ■ x') is well defined, equal to Tp{h') ■ x. It follows that 

{^{9) ■ ■ X = ip{g) ■ {Tp{h) ■ x) = ip{g) ■ Jp{h ■ x') = Jp{g ■ {h ■ x')) = Jp{f ■ x'). 

By symmetry, {(p{g') - Tpiji')) -x = {(p{g) ■'^{h)) -x, and since T is fc-determined, 
we have ip{g) • Tp{h) — (p{g') • '<p{h'). Thus, Tp is well defined on S. 

By essentially the same argument, one verifies that ^ preserves composi- 
tion. □ 

Corollary 2.15 Let S andT he k - determined preclones that are generated by 
their elements of rank at most k. If there exist bijections from to 7) for 
each i < k, that preserve all compositions of the form f ■ g, where f e Sn, 
g e Sn/ with n,£ < k, then S and T are isomorphic. 

Proof. Using the fact that S is generated by its elements of rank at most 
k and T is /c-determined. Proposition 2.14 shows that the given bijections 
extend to a morphism from S to T. This morphism is onto since T as well is 
generated by its rank k elements. It is also fc-injective by construction, and 
hence it is injective by Lemma 2.12 since S is fc-determined. □ 



2.5 Preclones, magmoids and strict monoidal categories 



The point of this short subsection is to verify the close connection between 
the category of preclones and the category of magmoids, cf. [2], which are 
in turn a special case of strict monoidal categories, cf. [28]. We recall that a 
magmoid is a category M whose objects are the nonnegative integers equipped 
with an associative bifunctor © such that 0©a; = a; = a;©0. A morphism of 
magmoids is a functor that preserves objects and ©. We say that a magmoid 
M is determined by its scalar morphisms if each morphism f:n^m can be 
written in a unique way as a ®-sum /i ® • • • ® /n, where each /j is a morphism 
with source 1. Moreover, there is a morphism to n if and only if n = (in 
which case there is a unique morphism) . 

Proposition 2.16 The category of preclones is equivalent to the full subcate- 
gory of magmoids spanned by those magmoids which are determined by their 
scalar morphisms. 

Proof. With each preclone S, we associate a category whose objects are the 
nonnegative integers and whose morphisms n ^ m are the elements of Sn,m-i 
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that is, the n-tuples of elements of S of total rank m. Composition is defined 
in the following way: let / = /i ® ■■■©/„ G S^^m and g = Qi® - ■ -^Qm ^ Sm,p, 
and suppose that /j G ^m., i G [n] (so that m = J2ie[n] ''^i) ■ ^^^h i, let 
9i = 9m,+-+mi-,+i © ■ • ■ © gmi+-+mi- Then we let 



f-9^fl-9l®---®fm-gm- 

The identity morphism at object n is the n-tuple n = 1 © • • • ® 1. Note that 
when n = 0, this is the unique morphism 0^0, and there are no morphisms 
from to n if n 7^ 0. 

One may then regard © as a bifunctor S x S ^ S that maps a pair (/, g) 
with / = /i © • • • © /n e Sn^p and g — gi ® • ■ ■ ® gm ^ '5m,q to the morphism 
/i © ■ ■ ■ © /n © 5"! © • • • © fi'm from n + m to p + q. Then S, equipped with the 
bifunctor ©, is a magmoid. Moreover, 5", as a magmoid, is determined by its 
scalar morphisms. 

It is clear that each preclone morphism determines a functor between the cor- 
responding magmoids which is the identity function on objects and preserves 
© and is thus a morphism of magmoids. 

Conversely, if M is a magmoid determined by its scalar morphism, then its 
morphisms with sourse 1 constitute a preclone S, moreover, M is isomorphic 
to the magmoid determined by 5". □ 



3 Recognizable tree languages 

As discussed in the introduction, the theory of (regular) tree languages is well 
developped [23,24,8] (see also Section 2.3.1 above). Here we slightly extend 
the notion of tree languages, to mean any subset of some SM^, /c > 0. In the 
classical setting, tree languages are subsets of EMq. 

The preclone structure on EM, described in Section 2, leads in a standard 
fashion to a definition of recognizable tree languages [17,30,9,12,40]. This is 

discussed in some detail in Section 3.1. As we will see in Section 3.2, recog- 
nizability extends the classical notion of regularity for tree languages, and it 
gives us richer algebraic tools to discuss these languages. Further examples 
are given in Section 3.3. 
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3.1 Syntactic preclones 



Suppose that a : EM — > 5" is a preclone morphism, or a morphism (EM, E) — > 
{S, A). We say that a subset L of EM^ is recognized by a ii L — a~^a{L), or 
equivalently, if L = a~^{F) for some F C Sk- Moreover, we say that L is 
recognized by S, or by {S, A), if L is recognized by some morphism EM S 
or (EM, E) {S,A). FinaUy, we say that a subset L of EM^ is recognizable 
if it is recognized by a finitary preclone, or pgf-pair. As usual, the notion of 
recognizability can be expressed equivalently by stating that L is saturated 
by some locally finite congruence on EM, that is, L is a union of classes of a 
congruence which has finite index on each sort [10,11,40]. 

With every subset L C EM^, recognizable or not, we associate a congruence on 
EM, called the syntactic congruence of L. This relation is defined as follows. 
First, an n-ary context in TiMk is a tuple (m, /ci, /C2) where 

• /ci, /c2 are nonnegative integers, 

• ue EMfc^+i+jk2, and 

• V ^ Vl® ■ ■ ■ ®Vn e EM„,£, with k^ki+£ + k2. 

{u, ki, V, ^2) is an L-context of an element / G EM„ if u ■ (ki © / • v © k2) G L. 
Recall that k denotes the ©-sum of k terms equal to 1. Below, when ki and /c2 
are clear from the context (or do not play any role), we will write just {u,v) 
to denote the context (u, /ci, /C2). 



For each f,g E EM„, we let / if and only if / and g have the same 

L-contexts. 

Proposition 3.1 The relation ^l, associated with a subset L of T,Mk, is a 
preclone congruence which saturates L. 

Proof. Suppose that /, /' G EM„ and g,g' G EM„^m with / ~l /', g = 
gi (B ■ ■ ■ (B gn, g' = g[ ® ■ ■ ■ (B g'n and gi ~l g- for each 1 < i < n. We prove 
that f ■ g f ■ g' ■ Let rrii be the rank of g^ and 51^, so that m = J2ie[n] 
and consider any m-ary context (u, ki,v, k-^) in EMj^. Then v G EM^ ^ with 
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£ = k — {ki + ^2). Thus. I' is an m-tuple, and we let wi be the ©-sum of the 
first mi-terms of v, W2 be the ©-sum of the following m2-terms of v, etc, until 
finally w„ is the ©-sum of the last m„ terms of v. In particular, we may write 

V = Wi (B ■ ■ ■ ® Wn- 

Since f '-^l f, we have 

ii • (ki © / ■ 5( ■ w © ka) e L <S=^ m • (ki © /' ■ 51 ■ w © ks) G L. 

It suffices to consider the ra-ary context (n,ki,gf • v,\i2), where g • v — {gi (B 
■ ■ ■ ® gn) ■ v stands ior gi ■ Wi (B ■ ■ ■ ® gn ■ 

Moreover, since g^ gl, we have 



u- (ki® f ■ (g[-wi®---® g[_^ ■ Wi^i ®gi-Wi® gi+i ■ Wi+i • • • © g'n • Wn) © ka) G L 
^u-{ki®f-{g[-wi®---®g--Wi® gi+i ■ Wi+i © • • • © ^„ • © ka) G L, 

for each 1 < i < n. To justify this statement, it suffices to consider the 
following m^-ary context (for g^ and g^), 

(u-{g[-wi®- ■ ■ g[_;i-Wi-i®l®gi+i-Wi+i®- ■ -^gn-Wn), ki + h,i, Wh lj,2+k2), 

where is the sum of the ranks of Wi, . . . , and £j^2 is the sum of the 
ranks of Wj+i, 

We now have 



M • (ki © /' • 5 • -u © k2) e L M ■ (ki © / ■ (^1 ■ ivi ®---®gn-Wn)®\^2)eL 

• (ki © / • (yi • Wl © • • • © • Wn) © k2) e L 

■ (ki © / ■ ■ © ■ ■ ■ © • w„) © k2) e L 
ti • (ki © / • 5(' • V © k2) e L. 

This completes the proof that ~l is a congruence. Next we observe that an 
element / G SM^ is in L if and only if the fc-ary context (1, k) is an L-contcxt 
of /: it follows immediately that saturates L. □ 



We denote by {M^, Sj^) the quotient p^f-pair {T^M/r^i, T,/r^]^), called the syn- 
tactic pg-pair of L. Ml is the syntactic preclone of L and the projection mor- 
phism TjL' SM — > Ml, or rjL' (SM, E) — > {Ml, Sl), is the syntactic morphism 
of L. We note the following, expected result. 

Proposition 3.2 The syntactic congruence of a subset L ofT,Mk is the coars- 
est preclone congruence which saturates L. A preclone morphism a: EM — > S 
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(resp. a morphism of pg -pairs a: (SAf, S) {S,A)) recognizes L if and only 
if a can be factored through the syntactic morphism 7]^ . In particular, L is 
recognizable if and only if is locally finite, if and only if is finitary. 

Proof. Let ~ be a congruence saturating L and assume that f,g& SM„ are 
~-equivalent. Let {u, ki, v, ^2) be an n-ary context: then u ■ (ki (B f -v (B ~ 
u ■ (ki Q) g ■ V Q) k2), and since ~ saturates L, u ■ (ki © / ■ f © k2) G L iff 
u ■ (ki ffi 5^ • u © k2) e L. Since this holds for all n-ary contexts in SM^, it 
follows that f 9- n 



We also note that syntactic preclones are finitely determined. 

Proposition 3.3 The syntactic preclone of a subset L ofEM^ is k-determined. 

Proof. We show that ii f,g E SM„ and / • /i ~l • /i for all /i e SM„^£ with 
i < k, then / ~i g, that is, / and g have the same L-contexts. 

Let {u, ki, V, /C2) be an L-context of /. Note in particular that v G SM^^^p with 
k — ki + p + k2. It follows that f -v E SMp, and that f ■ v g ■ v. Moreover 
(u, /ci, p, k2) is an L-context oi f • v. But in that case, (w, ki, p, k^) is also an 
L-contcxt o{ g-v, and hence {u, ki, v, ko) is an L-context of g, which concludes 
the proof. □ 



3.2 The usual notion of regular tree languages 



We now turn to tree languages in the usual sense, that is, subsets of EAfo- For 
these sets, there exists a well-known notion of (bottom-up) tree automaton, 
whose expressive power is equivalent to monadic second-order definability, to 
certain rational expressions, and to recognizability by a finite E-algebra [23,24] 
(see Section 2.3.1). The tree languages captured by these mechanisms are said 
to be regular. It is an essential remark (Theorem 3.4 below) that the regular 
tree languages are exactly the subsets of EMq that are recognized by a finitary 
preclone. 

Recall that the minimal tree automaton of a regular tree language is the 
least deterministic tree automaton accepting it, and the S-algebra associated 
with this automaton is called the syntactic T,-algebra of the language. It is 
characterized by the fact that the natural morphism from the initial E-algebra 
to the syntactic E-algebra of L, factors through every morphism of E-algebra 
which recognizes L (see Section 2.3.1 and [23,24,1]). 

Theorem 3.4 A tree language L C EMq is recognizable if and only if it is 
regular. Moreover, the syntactic preclone (resp. pg-pair) of L is the preclone 



17 



(resp. pg-pair) associated with its syntactic 'E-algebra. 

Proof. Let Q be the syntactic S-algebra of L, let {S, A) be its syntactic pg- 
pair, and let rj: (EM, E) — > {S, A) be its syntactic morphism. As discussed in 
Section 2.3.1, the pg-pair associated with Q, pg{Q), recognizes L, and hence 
the syntactic morphism of L factors through an onto morphism of pg-pairs 
P&iQ) ~^ {S,A). In particular, if L is regular, then Q is finite, so pg{Q) is 
finitary, and so is {S, A): thus L is recognizable. 

Conversely, assume that L is recognizable. Since (S, A) is finitary and 0- 
determined (Proposition 3.3), so {S, A) is isomorphic to a sub-p^f-pair of T(Sq) 
by Proposition 2.13. Using again the discussion in Section 2.3.1, Sq has a nat- 
ural structure of S-algebra (via the morphism r]), such that {S.A) = pg(S'o) 
and such that 5*0 recognizes L as a S-algebra. In particular, L is recognized 
by a finite E-algebra, and hence L is regular. 

Moreover, the recognizing morphism EMq Sq is the restriction to EMq 
of 1], the syntactic morphism of L. Therefore there exists an onto morphism 
of E- algebras Sq ^ Q, which in turn induces a morphism of preclones from 
{S,A) = pg(S'o) onto pg{Q). Since Q and So are finite, it follows that the 
morphisms between them described above are isomorphisms, and this implies 
that pg{Q) is isomorphic to {S,A). O 

While not difficult. Theorem 3.4 is important because it shows that we are not 
introducing a new class of recognizable tree languages. We are simply associ- 
ating with each regular tree language a finitary algebraic structure which is 
richer than its syntactic E-algebra (a.k.a. minimal deterministic tree automa- 
ton). This theorem also implies that the syntactic pg-pair of a recognizable 
tree language has an effectively computable finite presentation. 

Remark 3.5 If L C EMq, the definition of the syntactic congruence of L 
involves the consideration of n-ary contexts in EMq. Such contexts are neces- 
sarily of the form (u, 0,v,0), where u e EMi and v e T,Mn,o, which somewhat 
simplifies matters. □ 



3.3 More examples of recognizable tree languages 

The examples in this section are directly related with the preclones discussed in 
Section 2.3.2. Let A be a ranked Boolean alphabet, that is, a ranked alphabet 
such that each A„ is either empty or equal to {0„, 1„}, and Aq and at least 
one A„ (n > 2) are nonempty. Let A; > be an integer. 



18 



Verifying the occurrence of a letter Let Kk{3) be the set of aU trees in 
AMfc containing at least one vertex labeled 1„ (for some n). Then ^^^(3) is 
recognizable, by a morphism into the preclone Tg (see Example 2.5). 

Let a: AM T3 be the morphism of preclones given by a(0„) = or„ (q;(0o) = 
falseo) and a(l„) = true„ whenever A„ 7^ 0. It is not difficult to verify that 
«^^(truefe) = Kk{3). Moreover, a{A) contains a generating set of T3, so a is 
onto, and the syntactic morphism of Kk(3) factors through a. But Tg has at 
most 2 elements of each rank, so any proper quotient M of Tg has exactly 
one element of rank n for some integer n. One can then show that M cannot 
recognize Kk{3). Thus the syntactic pg'-pair of Kk{3) is (Tg, a{A)). 

If S is any ranked alphabet such that Sq and at least one S„ (n > 1) is 
nonempty, if E' is a proper nonempty subset of E, and ii'fc(E') is the set of 
all trees in E'M^ containing at least a node labeled in E', then ^^(E') too 
has syntactic preclone T3. The verification of this fact can be done using a 
morphism from EM to AM, mapping each letter a of rank n to 1„ if it is in 
E', to 0„ otherwise. 



Counting the occurrences of a letter Let p, r be integers such that 
< r < p and let K^^^p) consist of the trees in AM^ such that the number 
of vertices labeled 1„ (for some n) is congruent to r modulo p. Then Kk{3p) 
is recognizable, by a morphism into the preclone Tp (see Example 2.6). 

Let indeed a: AM Tp be the morphism given by a(0,„) = /„^o ct(ln) = 
fn,i whenever A„ 7^ 0. Then one verifies that a'^{fk,r) = -^'fc(3p). Moreover, 
Oi{A) contains a generating set of Tp, so a is onto, and the syntactic morphism 
of Kk(3p) factors through a. An elementary verification then establishes that 
no proper quotient of Tp can recognize ^"^(3^), and hence the syntactic pg-pair 
oiKk{3;) is {Tp,a{A)). 

As above, this can be extended to recognizing the set of all trees in EMj. 
where the number of nodes labeled in some proper nonempty subset E' of E 
is congruent to r modulo p. 

Using the same idea, one can also handle tree languages defined by counting 
the number of occurrences of certain letters modulo p threshold q. It suffices to 
consider, in analogy with the mod p case, the languages of the form Kk{3p^^), 
and the preclone Tp^q, a sub-preclone of T(Bp+g), whose rank n elements are 
the mappings Z^: (ri, . . . , r„) ^-^ ri + ■ ■ ■ + r„ + r, where the sum is taken 
modulo p threshold q. Note that this notion generalizes both above examples, 
since Tp — Tp^o and that T^ — Ti^i. 
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Identification of a path Let -ft'fc(path) be the set of all the trees in AM^ 
such that all the vertices along at least one maximal path from the root to a leaf 
arc labeled 1„ (for the appropriate values of n). Then /^^(path) is recognized 
by the preclone Tpath (see Example 2.7). 

Let indeed a: AM Tpath be the morphism given by «(0„) = false„, a(lo) = 
trueo and q;(1„) = or„ (n ^ 0). One can then verify that Q;~^(truefc) — 
i^fe(path). 



Identification of the next modality Let Kk{nex.t) consist of all the trees 
in AMk such that each maximal path has length at least two and the children 
of the root are labeled 1„ (for the appropriate n). We show that Kk{next) is 
recognizable. 

RecaU that B = {true, false}, and let a: AM T{M x B) be the morphism 
given as follows: 

• a(Oo) is the nuUary constant (false, false) o, 

• a{lo) is the nuUary constant (false, true)o, 

• if n > 0, then a(0„) is the n-ary map ((xi, yi),..., (.t„, ?/.„)) {AiUi, false) 

• if n > 0, then a{ln) is the n-ary map ((xi, yi), . . . , (x„, yn)) ^ {Kyi, true). 

One can verify by structural induction that for each element x e AM^, the 
second component of a{x) is true if and only if the root of x is labeled 1„ for 
some n, and the first component of a{x) is true if and only if every child of the 
root of X is labeled 1„ for some n. that is, if and only if a; £ /^^(next). Thus 
Kk{next) is recognized by the morphism a. 



4 Pseudov£irieties of preclones 

In the usual setting of one-sorted algebras, a pseudovariety is a class of finite 
algebras closed under taking finite direct products, sub-algebras and quotients. 
Because we are dealing with preclones, which are infinitely sorted, we need to 
consider finitary algebras instead of finite ones, and to adopt more constraining 
closure properties in the definition. (We discuss in Remark 4.18 an alternative 
approach, which consists in introducing stricter finiteness conditions on the 
preclones themselves, namely in considering only finitely generated, finitely 
determined, finitary preclones.) 

We say that a class of finitary preclones is a pseudovariety if it is closed under 
finite direct product, sub-preclones, quotients, finitary unions of a;-chains and 
finitary inverse limits of a;-diagrams. Here, we say that a union T = Un 
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of an a;-chain of preclones T^"^ n > is finitary exactly when T is finitary. 
Finitary inverse limits lim^T^"^) of a;-diagrams </7„:r("+^) T^"'\ n > are 
defined in the same way. 

Remark 4.1 To be perfectly rigorous, we actually require pseudovarieties to 
be closed under taking preclones isomorphic to a finitary a;-union or to a 
finitary inverse limit of an a;-diagram of their elements. □ 



Remark 4.2 Recall that the inverse limit T of the cu-diagram {^pn)n>o, writ- 
ten T = lim„ T*^"^ if the (pn- T*^""*"^^ —>■ T*^") are clear, is the sub-preclone of 
the direct product On whose set of elements of rank m consists of those 
sequences (x„)„>o with Xn G such that = x„, for all n > 0. We 

call the coordinate projections tt^ : lim„ T*^"^ — > T^^ the induced projection 
morphisms. 




■ ■ ■ ► «► • • • «► 7^(0) 

The inverse limit has the following universal property. Whenever S* is a pre- 
clone and the morphisms V^n^ 5* T*^") satisfy ipn = V'n o V'n+i for each n > 0, 
then there is a unique morphism ip: S ^ lini„ T*^") with Tin ° ^ = ^n, for all n. 
This morphism maps an element s & S to the sequence {ipn{s))n>o- ^ 



Example 4.3 Here we show that the inverse hmit of an a;-diagram of 1-gen- 
erated finitary preclones needs not be finitary. Let E = {o"}, where a has rank 
1 and consider the free preclonc SM. Note that EM has only elements of 
rank 1, and that SMi can be identified with the monoid a*. For each n > 0, 
let Kin be the congruence defined by letting a'' o"^ if and only if k = i, 
or k,£ > n. Let T^") = EM/Ri„. Then T^") is again cr-generated, and it can 
be identified with the monoid {0,1,..., n} under addition threshold n. In 
particular, T*^") is a finitary preclone. Since ?5i„+i-equivalent elements of SM 
are also ~„-cquivalent, there is a natural morphism of preclones from T'^"^^^ 
to T*-") , mapping a to itself, and the inverse limit of the resulting a;-diagram 
is EM itself, which is not finitary. □ 



Pseudovarieties of preclones can be characterized using the notion of division: 
we say that a preclone 5" divides a preclone T, written 5" < T, if 5" is a quotient 
of a sub-preclone of T. It is immediately verified that a nonempty class of 
finitary preclones is a pseudovariety if and only if it is closed with respect to 
division, binary direct product, finitary unions of a;-chains and finitary inverse 
limits of a;-diagrams. 
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Example 4.4 It is immediate that the intersection of a collection of pseu- 
dovarieties of preclones is a pseudovariety. It follows that if K is a class of 
finitary preclones, then the pseudovariety generated by K is well defined, as 
the least pseudovariety containing K. In particular, the elements of this pseu- 
dovariety, written (K), can be described in terms of the elements of K, taking 
subpreclones, quotients, direct products, finitary unions of a;-chains and in- 
verse limits of a;-diagrams. See Section 4.2 below. 

We discuss other examples in Section 5.2. □ 

We first explore the relation between pseudovarieties and their finitely de- 
termined elements, then we discuss pseudovarieties generated by a class of 
preclones, and finally, we explore some additional closure properties of pseu- 
dovarieties. 



4-1 Pseudovarieties and their finitely determined elements 
Proposition 4.5 Let S be a preclone. 

(1) S is isomorphic to the inverse limit liuin S^'^^ of an uj -diagram, where each 

S'*'"-' is an n-determined quotient of S. 

(2) If S is finitary, then S is isomorphic to the union of an uo-chain Un>o^*'"''; 
where each T^^^ is the inverse limit of an cu-diagram of finitely generated, 
finitely determined divisors of S. 

Proof. Let 5'*^"-' = S/^n (where ~„ is defined in Section 2.4) and let 7r„: S 
S^'^^ be the corresponding projection. Since ~„+i-related elements of S are 
also ~„-related, there exists a morphism of preclones </?„: S^'^'^^^ — > -S^") such 
that 7r„ = o TTn+i- Thus the tt^ determine a morphism n: S ^ lim„5'*^"\ 
such that 7r(s) = (7r„(s))„ for each s e S (Remark 4.2). 

Moreover, since ~„ is the identity relation on the elements of S of rank at 
most n, we find that for each k < n, TCn establishes a bijection between the 
elements of rank k oi S and those of S^'^\ In particular, tt is injective since 
each element of S has rank k for some finite integer k. Furthermore, for each 
k < n, (pn establishes a bijection between the elements of rank k, and it follows 
that each element of rank k of lim„ S^'^^ is the vr-image of its k-th component. 
That is, TT is onto. Finally, Lemma 2.11 shows that each 5"*^") is n-determined. 
This concludes the proof of the first statement. 

We now assume that S is finitary, and we let T*^™") be the sub-preclone gener- 
ated by the elements of S of rank at most m. Then T^"^^ is finitely generated, 
and the first statement shows that T^"*) is the inverse hmit of an a;-diagram 
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of finitely generated, finitely determined quotients of T^'^\ which are in par- 
ticular divisors of S". □ 



The following corollary follows immediately. 

Corollary 4.6 Every pseudovariety of preclones is uniquely determined by its 
finitely generated, finitely determined elements. 

We can go a little further, and show that a pseudovariety is determined by 
the syntactic preclones it contains. 

Proposition 4.7 Let S be a finitely generated, k-determined, finitary pre- 
clone, let A be a finite ranked set and let ip: AM S be an onto morphism. 
Then S divides the direct product of the syntactic preclones of the languages 
ip~^{s), where s runs over the (finitely many) elements of S of rank at most 
k. 

Proof. It suffices to show that if x, |/ e AMn for some n > and x ~<^-i(s) y 
for each s & Si, i < k, then (p{x) — (p{y). 

First, suppose that x and y have rank n < k, and let s = ip{x). Then (1, 0, n, 0) 
is a <^~^(s)-context of x, so it is also a <^~^(s)-context of y, and we have 
ip{y) — s — ^{x). Now, if X and y have rank n > k, let v & Sn,p for some 
p < k. Since (f is onto, there exists an element z e AM^^p such that (p{z) — v. 
For each s & S^, i < k, we have x y, and hence also x ■ z ~(^-i(s) y ■ z. 

The previous discussion shows therefore that •^[x ■ z) = ip{y ■ z), that is, 
(p{x) ■ V = ip{y) ■ V. Since S is fc-determined, it follows that ip{x) = (p{y). □ 

Corollary 4.8 Every pseudovariety of preclones is uniquely determined by 
the syntactic preclones it contains. 

Proof. This follows directly from Corollary 4.6 and Proposition 4.7. □ 



4.2 The pseudovariety generated by a class of preclones 

Let I, H, S, P, L, U denote respectively the operators of taking all isomorphic 
images, homomorphic images, subpreclones, finite direct products, finitary in- 
verse limits of an a;-diagram, and finitary a;-unions over a class of finitary 
preclones. The following fact is a special case of a well-known result in univer- 
sal algebra. 

Lemma 4.9 If K. is a class of finitary preclones, then HSP(K) is the least 
class of finitary preclones containing K., closed under homomorphic images. 
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subpreclones and finite direct products. 

Next, we observe the following elementary facts. 

Lemma 4.10 For all classes K of finitary preclones, we have 

(1) PL(K) C LP(K), (2) PU(K) C UP(K), 

(3) SL(K) C LS(K), (4) SU(K) C US(K). 

Proof. To prove the first inclusion, suppose that S is the direct product of 
the finitary preclones S^'^\ i e [n], where each 5"^*^ is a limit of an a;-diagram 
of preclones S^'^'^^ in K determined by a family of morphisms ipi^k'- S^^''^'^^^ — > 
S^''''\ k >0. For each k, let T^*^) be the direct product IlieN -^^''^^ 
(fk = Yli(z[n] ^i,k- T^'^'^^^ It is a routine matter to verify that S is iso- 

morphic to the limit of the a;-diagram determined by the family of morphisms 
^ T^'^), A; > 0. Thus, S e LP(K). 

Now, for each i E [n], let {S^^'''^)k>o be an cj-chain of finitary preclones in K. 
Let us assume that each S**^*-* = Ua:>o 'S'*'*''^'' is finitary, and let S = Ylie[n] 'S'*-*^- 
If s = (si, . . . , Sn) G S, then each Sj belongs to S^^'''^\ for some /cj. Thus s e 
Ylie[n] S^^'''\ where k — maxki, and we have shown that 5" e Ufc>onie[n] •5'*^*'*^^ 
so that 5 e UP(K). 

To prove the third inclusion, let T be a sub-preclone of lim„ S^^\ the finitary 
inverse limit of an a;-diagram (pn- S^"'^^^ 5*'-"^ of elements of K. Let 7r„: T 
be the natural projections (restricted to T), and let T(") = 7r„(r). Then 

Tin) 

is a subpreclone of 5"^") for each n. Moreover, the restrictions of the 
to T("+i) define an cij-diagram of subpreclones of elements of K, and it is an 
elementary verification that T = lim„T*^'^). Since T is finitary, we have proved 
that T e LS(K). 

As for the last inclusion, let T be a subpreclone of a finitary union Ufc>o S^'''' 
with Si^'> e K, for all A; > 0. Let T^'^) = S^''^ HT for each A; > 0. Then each T^'^) 
is a subpreclone of Si'''> and T = Ufe>o T^'"^- It follows that T e US(K). □ 

Our proof of the third inclusion actually yields the following result. 

Corollary 4.11 If a finitary preclone S embeds in an inverse limit lim„ Si'^\ 
then S is isomorphic to a (finitary) inverse limit lim„T*^"'), where each T*^") is 
a finitary sub-preclone of Si'^^ . 

We can be more precise than Lemma 4.10 for what concerns finitely generated, 
finitely determined preclones. 

Lemma 4.12 Let T be a preclone which embeds in the union of an cu -chain 
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{S^'^^)n- IfT is finitely generated, then T embeds in S^'^^ for all large enough 
n. 

Proof. Since T is finitely generated, its set of generators is entirely contained 
in some S''^\ and hence T embeds in each n> k. O 

Lemma 4.13 Let T be a quotient of the union of an uj-chain {S^'^^)n- IfT is 
finitely generated, then T is a quotient of S^"^^ for all large enough n. 

Proof. Let Lp he a, surjective morphism from 5* = IJn 'S'*-"-' onto T. Since T is 
finitely generated, there exists an integer k such that Lp{S^^^) contains all the 
generators of T, and this implies that the restriction of 9? to S^'^'> (and to each 
n>k)\s onto. □ 

Lemma 4.14 Let T be a preclone which embeds in the inverse limit lim„ S^'^^ 
of an uj -diagram, and for each n, let TTn-T ^ S^"'' be the natural projection 

(restricted to T). IfT is finitary, then for each k, 7r„ is k-injective for all large 
enough n. If in addition T is finitely determined, then T embeds in Sn for all 
large enough n. 

Proof. Since T is finitary, T^ is finite for each integer k, and hence there exists 
an integer such that 7r„ is injcctive on T^ for each n > n^. In particular, 
for each integer k, iTn is A;-injective for all large enough n. The last part of the 
statement follows from Lemma 2.12. □ 



Lemma 4.15 Let T be a quotient of the finitary inverse limit lim„ S^^^ of an 
uj-diagram. IfT is finitely determined, then T is a quotient of a sub-preclone 
of one of the S^^^. 

Proof. Let S = lim„ S**-"^ and let 7r„: 5* — >■ S*^"-* be the corresponding projec- 
tion. Let also (f-.S^Thean onto morphism, and let /c > be an integer such 
that T is /c-determined. By Lemma 4.14, 7r„ is /c-injective for some integer n. 

Consider the preclone 7r„(5') C S^'^\ Then we claim that the assignment 
T^nis) ^ (fiis) defines a surjective morphism 7r„(yS') — > T. The only nontrivial 
point is to verify that this assignment is well defined. Let s, s' G Sp and sup- 
pose that vr„(s) = vr„(s'). We want to show that (f{s) = ^p{s'), and for that 
purpose, we show that ip{s) ■ v = (p{s') ■ v for each v e Tp^£, £ < k (since T is 
/c-determined) . Since (f is onto, there exists w e Sp^e such that v — ^{w). In 
particular, ip{s) ■ v — (p{s ■ w) and similarly, </?(«') • v — (p{s' ■ w). Moreover, we 
have TTnis-w) — nn{s' -w). Now s-w and s' -w lie in Se, and tt^ is injective on Se, 
so s • w — s' ■ w. It follows that ip{s) ■ v — (p{s') ■ v, and hence (p{s) — (p{s'). □ 
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We are now ready to describe the finitely generated, finitely determined ele- 
ments of the pseudovariety generated by a given class of finitary preclones. 

Proposition 4.16 Let K. be a class of finitary preclones. A finitely generated, 
finitely determined, finitary preclone belongs to the pseudovariety (K) gener- 
ated by K if and only if it divides a finite direct product of preclones in K., 
i.e., it lies in HSP(K). 

Proof. It is easily verified that (K) = (Jn^n, where Vq = K and V„+i = 
HSPUHSPL(Vn). We show by induction on n that if T a finitely generated, 
finitely determined preclone in V„, then T e HSP(K). 

The case n = is trivial and we now assume that T G V„+i. By Lemma 4.10, 
T hes in HUSPHLSP(V„). Then Lemma 4.13 shows that T is in fact in 
HSPHLSP(V„), which is equal to HSPLSP(V„) by Lemma 4.9, and is 
contained in HLSP(V„) by Lemma 4.10 again. Now Lemma 4.15 shows that T 
lies in fact in HSP(V„), and we conclude by induction that T e HSP(K). □ 

Corollary 4.17 //K is a class of finitary preclones, then (K) = lULHSP(K). 

Proof. The containment lULHSP(K) C (K) is immediate. To show the re- 
verse inclusion, we consider a finitary preclone T G (K). Then T = [JT^^\ 
where T*^"-* denotes the subpreclone of T generated by the elements of rank 
at most n. Now each T^"-^ is finitely generated, and by Proposition 4.5, it is 
isomorphic to the inverse limit of the a;-diagram formed by the finitely gener- 
ated, finitely determined preclones m > 0. By the Proposition 4.16, 
each of these preclones is in HSP(K), so T e lULHSP(K). □ 

Remark 4.18 As indicated in the first paragraph of Section 4, Proposi- 
tion 4.16 hints at an alternative treatment of the notion of pseudovarieties 
of preclones, limited to the consideration of finitely generated, finitely deter- 
mined, finitary preclones. Say that a class K of finitely generated, finitely 
determined, finitary preclones is a relative pseudovariety if whenever a finitely 
generated, finitely determined, finitary preclone S divides a finite direct prod- 
uct of preclones in K, then S is in fact in K. For each pseudovariety V, the 
class Vfin of all its finitary, finitely generated, finitely determined members is 
a relative pseudovariety, and the map V i— > Vfin is injective by Corollary 4.6. 
Moreover, Proposition 4.16 can be used to show that this map is onto. That 
is, the map V ^ Vfjn is an order-preserving bijective correspondence (with 
respect to the inclusion order) between pseudovarieties and relative pseudova- 
rieties of preclones. □ 

Proposition 4.16 also leads to the following useful result. Recall that a finitely 
generated preclone S is effectively given if we are given a finite generating set 
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A as transformations of finite arity of a given finite set Q, see Section 2.3.1. 

Corollary 4.19 Let S and T be effectively given, finitely generated, finitely 
determined preclones. Then it is decidable whether T belongs to the pseudova- 
riety of preclones generated by S. 

Proof. Let A (resp. B) be the given set of generators of S (resp. T) and letV 
be the pseudovariety generated by S. By Corollary 4.16, T e V if and only if 
T divides a direct power of -S", say, T < S'^. Since B is finite, almost all the 
sets Bh are empty. We claim that the exponent m can be bounded by 

n i^^i""^'- 

Indeed, there exists a sub-preclone S' C S"^ and an onto morphism S' — > T. 
Since B generates T, we may assume without loss of generality that this 
morphism defines a bijection from a set A' of generators of S' to B, and in 
particular, we may identify Bk with A'j^, a subset of A^. Next, one verifies that 
if m is greater than the bound in the claim, then there exist 1 < i < j < m 
such that for all k and x E A'l^, the i-th and the j-th components of x are 
equal — but this implies that the exponent can be decreased by 1. 

Thus, it suffices to test whether or not T divides S™", where m is given by the 
above formula. But as discussed above, this holds if and only if A™' contains a 
set A' and a rank preserving bijection from A' to B which can be extended to 
a morphism from the sub-preclone of S'^ generated by A' to T. By Proposi- 
tion 2.14, and since S and T are effectively given and T is finitely determined, 
this can be checked algorithmically. □ 



4-3 Closure properties of pseudovarieties 

Here we record additional closure properties of pseudovarieties of preclones. 

Lemma 4.20 Let Y be a pseudovariety of preclones and let T be a finitary 
preclone. IfT embeds in the inverse limit of an uj-diagram of preclones in V, 
then T e V. 

Proof. The lemma follows immediately from Corollary 4.11. □ 

Proposition 4.21 Let V be a pseudovariety of preclones and let S be a fini- 
tary preclone. If for each n > 0, there exists a morphism ipn- S S'*^"-' such 
that S^""^ e V and (fin is infective on elements of rank exactly n, then 5" e V. 

Proof. Without loss of generality we may assume that each is surjective. 
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For each n > 0, consider the direct product T^^^ = S^^^ x ■ ■ ■ x S^"-\ which is 
in V, and let ^„ denote the natural projection of T^"^^) onto T("). Let also 
ipn- S ^ T^") be the target tupling of the morphisms ^Pi, i < n, let T be the 
inverse limit hrn^T^") determined by the morphisms /Xn, and let 7r„: T — > T^") 
be the corresponding projection morphisms. 

Note that each ipn is n-injective, and equals the composite of V'n+i and /i„. 
Thus, there exists a (unique) morphism ip: S ^ T such that the composite of 
ip and 7r„ is ipn for each n. It follows from the n-injectivity of each ip^, that ip 
is injective. Thus, S embeds in the inverse limit of an a;-diagram of preclones 
in V, and we conclude by Lemma 4.20. □ 

We note the following easy corollary of Proposition 4.21. 

Corollary 4.22 Let Y be a pseudovariety of preclones. Let S he a finitary 
preclone such that distinct elements of equal rank can he separated hy a mor- 
phism from S to a preclone in V. Then S 

Proof. For any distinct elements f,g of equal rank n, let ff^g'-S — > Sf^g be a 
morphism such that Sf^g e V and 'fif,g{f) 7^ (pf^g{g). For any integer n, let (pn 
be the target tupling of the finite collection of morphisms 9?^^ with f,gE Sn- 
Then is injective on Sn and we conclude by Proposition 4.21. □ 



4.4 Pseudovarieties of pg -pairs 

The formal treatment pseudovarieties of pg-pairs is similar to the above treat- 
ment of pseudovarieties of preclones - but for the following remarks. 

We define a pseudovariety of pg -pairs to be a class of finitary pg^-pairs closed 
under finite direct product, sub-p^f-pairs, quotients and finitary inverse limits 
of cij-diagrams. Our first remark is that, in this case, we do not need to mention 
finitary unions of a;-chains: indeed, finitary pgi-pairs are finitely generated, so 
the union of an a;-chain, if it is finitary, amounts to a finite union. 

Next, the notion of inverse limit of c<j-diagrams of pg-'gaim needs some clarifica- 
tion. Consider a sequence of morphisms of pgf-pairs, say tpn- (5'*^"'"'"^-', A*^""*"^)) 
[S'^''^\ A^""^). That is, each 9?^ is a preclone morphism from S'^^^^^ to 5*'-"'^ 
which maps into ^4^"). We can then form the inverse limit hm„ ,5^") of 

the a;-diagram determined by the preclone morphisms iprn and the inverse limit 
lim„ A*^") determined by the set mappings The inverse limit lim„(5("), 
of the tu-diagram determined by the morphisms of p^f-pairs (as determined 
by the appropriate universal limit, see Remark 4.2) is the pgf-pair (5, A), where 
A — hm„ A^") and S is the subpreclone of lim„ 5'^"^ generated by A. Recall 
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that this inverse Umit is called finitary exactly when S is finitary and A is 
finite (see Example 4.3). 

We now establish the close connection between this inverse hmit and the 
inverse limit of the underlying a;-diagram of preclones, when the latter is 
finitary. 

Proposition 4.23 Let A^")) be an oo-diagram of 

pg-pairs. Let S = lim„5(") and let and (T, A) = lim„(5("), A^). If S is 
finitary, then S — T. 

Proof. We need to show that A generates S. Without loss of generality, we 
may assume that each (p^ maps A^^~^^^ surjectively onto ^4^"^ and we denote 
by Xn the restriction of to . By definition, A is the inverse limit of the 

a;-diagram given by the Xm and we denote by p„: A — > A^'^^ the corresponding 
projection. We also denote by Xn and p„ the extensions of these mappings to 
preclone morphisms A("+^)M ^("^M and AM A^'^^M. It is not difficuh 
to verify that AM is the inverse limit of the cu-diagram given by the x„, and 
that the pn are the corresponding projections. 

A AM 



Pn+l 




.^(n+i)-^ ^(n) -A("+i)M-^r;-^A(")M • • ' 

A,n An 

Moreover, each pk is onto (even from A to A^^'^). Let indeed G A'^^\ Since 
the Xn are onto, we can define by induction a sequence (a„)„>fc such that 
Xn{0'n+i) — O'n fo^ each n > k. This sequence can be completed with the 
iterated images of Uk by Xk-i, ■ ■ ■ ; Xo to yield an element of A whose k-th 
projection is ak- 

Since A^") generates S^'^\ the morphism i/j^-A'^'^^M S^'^^ induced by id^{n) 
is surjective. Moreover, the composites o and ipn ° Xn coincide. 
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It follows that the morphisms ipn°Pn' — > S^"'^ and (fn^'ipn+i^Pn+i coincide, 
and hence there exists a morphism r: AM S such that TTn°^ = "^n^ Pn for 
each n. Since pn and ipn are onto, it follows that each 7r„ is surjective. 

We now use the fact that S is finitary. By Lemma 4.14, 7r„ is /c-injective for 

each large enough n. Let now s G Sk- We want to show that s G t{AM). 
Let rik be such that 7r„ is A;-injective for each n > n^. We can choose an 
element tn^. G A^'^'^^M such that ^nki'tn^) = """nfel-s)- Then, by induction, we 
can construct a sequence (tn)n of elements such that Xn{tn+i) — tn for each 
n > 0. We need to show that '^n{tn) — T^n{s) for each n. 

This equality is immediate for n < Uk, and we assume by induction that it 
holds for some n > Uk- We have 

Since TTn and tt^+i are surjective, since they are injective on 5"^, and since 
ipn ° — T^m wc find that is injective on S^'^^\ and hence — 
7r„+i(s), as expected. 

Thus {tn)n £ AM and r(t) = s, which concludes the proof that S is generated 
by A. □ 



5 Varieties of tree languages 



Let V = (Vs,fc)s,A; be a collection of nonempty classes of recognizable tree 
languages L C SM^, where S runs over the finite ranked alphabet and k 
runs over the nonnegative integers. We call V a variety of tree languages, or 
a tree language variety, if each Vs,fe is closed under the Boolean operations, 
and V is closed under inverse morphisms between free precloncs generated by 
finite ranked sets, and under quotients defined as follows. Let L C EM^ be 
a tree language, let ki and k2 be nonnegative integers, u G SM^j+i+^j and 
V G SM^^fc. Then the left quotient {u, ki, k2)^^L and the right quotient Lv~^ 
are defined by 

{u, ki, k2y^L = {t e T.Mn I n ■ (ki © t © G L} where k = ki + n + k2 
Lv~^^{t G EM„ \ t-v e L}, 

that is, (m, fci, k2)^^L is the set of elements of SM„ for which {u, ki, n, ^2) is 
an L-context, and Lv^^ is the set of elements of SM„ for which (1,0, f,0) is 
an L-context. Below we will write just u~^L for (m, ki, k2)~^L if ki and k2 are 
understood, or play no role. 
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A literal variety of tree languages is defined similarly, but instead of closure un- 
der inverse morphisms between finitely generated free preclones. we require clo- 
sure under inverse morphisms between finitely generated free pg-pa-irs. Thus, 
if L C EMfc is in a literal variety V and </?: AM — > EM is a preclone morphism 
with E, A finite and </^(A) C E, then (p~^{L) is also in V. 



5. 1 Varieties of tree languages vs. pseudovarieties of preclones 

The aim of this section is to prove an Eilenberg correspondence between pseu- 
dovarieties of preclones (resp. pg^-pairs), and varieties (resp. literal varieties) 
of tree languages. For each pseudovariety V of preclones (resp. pg-pairs), 
let var(V) = (Vs.fc)s.fc! where Vs.fc denotes the class of the tree languages 
L C EMfc whose syntactic preclone (resp. pg-pa.ii) belongs to V. It follows 
from Proposition 3.2 that var(V) consists of all those tree languages that can 
be recognized by a preclone (resp. pg^-pair) in V. 

Conversely, if W is a variety (resp. a literal variety) of tree languages, we 
let psv(>V) be the class of all finitary preclones (resp. pg-paiis) that only 
accept languages in W, i.e., a~^{F) C EM^ belongs to W, for all morphisms 
a: EM ^ ,5 (resp. a: (EM, E) ^ {S, A)), k > and F C Sk- 

Theorem 5.1 The mappings var and psv are mutually inverse lattice isomor- 
phisms between the lattice of pseudovarieties of preclones (resp. pg-pairs) and 
the lattice of varieties (resp. literal varieties) of tree languages. 

Proof. We only prove the theorem for pseudovarieties of pg-paiis and literal 
varieties of tree languages. It is clear that for each pseudovariety V of finitary 
p^f-pairs, if var(V) = (Vs,fe)E,fc, then each Vs,fe is closed under complementation 
and contains the languages and EM^. The closure of VE,fe under union follows 
in the standard way from the closure of V under direct product: if L, L' C SM^ 
are recognized by morphisms into p^f-pairs (S*, A) and (S", A') in V, then LUL' 
is recognized by a morphism into {S,A) x {S',A'). Thus Vs,fc is closed under 
the Boolean operations. 

We now show that V is closed under quotients. Let L C SM^ be in VE,fc, 
let a: (EM, S) — > {S, A) be a morphism recognizing L with {S, A) G V and 
L = a~^a{L), and let F = a{L). Let {u, ki,v, be an n-ary context, that 
is, u e EMfc^+i+fc2, V e EM„_^ and ki + £ + k2 = k. Now let F' = {/ e 
Se I a{u) ■ (ki © / © ks) e F}. Then for any t E EM^, a{t) e F' if and 
only if a{u) ■ (ki © a{t) © k2) G F, if and only if a{u ■ (ki © t © k2)) G F 
iff u ■ (ki © t © k2) G L. Thus, a^^{F') = {u, ki, k2)~^L, which is therefore 
in Ve,^. Now let F" = {/ G M„ : / • a{v) G L}. It follows as above that 
Lv-'^ = a-^{F") and hence Lv''^ G Vs,n- 
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Before we proceed, let us observe that we just showed the following: if L C 
EMfc is a recognizable tree language, then for each n > there are only 
finitely many distinct sets of the form ((«, ki, k2)~^ L)v~^ , where {u, ki, v, ^2) 
is an n-ary context of EM^.. 

Next, let (/?: (EM, E) (AM, A) be a morphism of pg-pairs and L C AM^. 
If L is recognized by a morphism a: (AM, A) {S,A), then Lp^'^{L) is rec- 
ognized by the composite morphism (p o a, and the closure of V by inverse 
morphisms between free pg^-pairs follows immediately. Thus the mapping var 
does associate with each pseudovariety of pgi-pairs a literal variety of tree 
languages, and it clearly preserves the inclusion order. 

Now consider the mapping psv: we first verify that if >V is a hteral variety 
of tree languages, then the class psv(>V) is a pseudovariety. Recall that, if 
{S,A) < {T,B), then any language recognized by {S,A) is also recognized 
by {T,B), so if each language recognized by (T,B) belongs to W, then the 
same holds for {S,A). Note also that any language recognized by the direct 
product {S, A) X (T, B) is a finite union of intersections of the form L n M, 
where L is recognized by {S,A) and M by {T,B)] thus psv(W) is closed 
under binary direct products. Finally, if (5", A) — lim„(S'^"\ A^")) is the finitary 
inverse limit of an c^^-diagram of finitary p(?-pairs, then Lemma 4.14 shows 
that the languages recognized by (5, A) are recognized by almost all of the 
(^(n)^^(n)-) rpj^^g ^g^^^^ ^ psv(yy), which concludcs the proof that psv(>V) is 
a pseudovariety of pg^-pairs. 

Let W be a literal variety of tree languages, and let V = var(psv(yV)). We now 
show that V = W. Since V consists of all the tree languages recognized by a 
pgf-pair in W = psv(>V), it is clear that V C >V. Now let L e Ws,fc, and let 
{Ml,Al) be its syntactic p^f-pair. To prove that {Ml,Al) G W, it suffices to 
show that if a: (EM, E) (M^, Ai) is a morphism of pg-paXis and x G M^, 
then a^^{x) G W. Since a morphism of p^f-pairs maps generators to generators, 
up to renaming and identifying letters (which can be done by morphisms 
between free p^f-pairs), we may assume that a is the syntactic morphism of L. 
Thus a'~^{x) is an equivalence class [w\ in the syntactic congruence of L, and 
hence 

a-\x)^ n {{uMM)~^L)v-' 

n fi {{u,k^,k2)-^L)v-^ 

where L denotes the complement of L. If x has rank n, the intersections in 
this formula run over n-ary contexts (m, /ci, f , A;2), and as observed above, these 
intersections are in fact finite. It follows that that a~^{x) G V. This concludes 
the verification that V = W, so var o psv is the identity mapping, and in 
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particular var is surjective and psv is injective. 

It is clear that both maps var and psv preserve the inclusion order. In order to 
conclude that they are mutually reciprocal bijections, it suffices to verify that 
var is injective. If V and W arc pseudovarieties such that var(V) = var(W) = 
V, then a tree language is in V if and only if its syntactic preclone is in V, if 
and only if its syntactic preclone is in W. Thus V and W contain the same 
syntactic preclones, and it follows from Corollary 4.8 that V = W. □ 

Remcirk 5.2 Three further variety theorems for finite trees exist in the lit- 
erature. They differ from the variety theorem proved above since they use a 
different notion of morphism, quotient, and syntactic algebra. The variety the- 
orem in [1,35] is formulated for tree language varieties over some fixed ranked 
alphabet and the morphisms are homomorphisms between finitely generated 
free algebras, whereas the "general variety theorem" of [36] allows for tree 
languages over different ranked alphabets and a more general notion of mor- 
phism, closely related to the morphisms of free pg-pairs. On the other hand, 
the morphisms in [19] arc much more general than those in either [1,35,36] or 
the present paper, they even include nonlinear tree morphisms that allow for 
the duplication of a variable. Another difference is that the tree language va- 
rieties in [1,35,36] involve only left quotients, whereas the one presented here 
(and the varieties of [19]) are defined using two sided quotients. The notion of 
syntactic algebra is also different in these papers: minimal tree automata in 
[1,35], a variant of minimal tree automata in [36], minimal clone (or Lawvere 
theory) in [19], and minimal preclone, or pg-pair, here. We refer to [19, Section 
14] for a more detailed comparative discussion. 

As noted above, the abundance of variety theorems for finite trees is due 
to the fact that there are several reasonable ways of defining morphisms and 
quotients, and a choice of these notions is reflected by the corresponding notion 
of syntactic algebra. No variety theorem is known for the 3-sorted algebras 
proposed in [41]. □ 



5.2 Examples of varieties of tree languages 
5.2.1 Small examples 

As a practice example, we describe the variety of tree languages associated 
with the pseudovariety (Tg) generated by Tg (see Section 2.3.2). 

Let S be a finite ranked alphabet and let L C SM^ be a tree language accepted 
by a preclone in (T3). Then the syntactic preclone S oi L lies in (T3). Recall 
that a syntactic preclone is finitely generated and finitely determined: it follows 
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from Proposition 4.16 that S divides a product of a finite number of copies of 
T3. By a standard argument, L is therefore a (positive) Boolean combination 
of languages recognized by a morphism from EM to Tg. 

Now let r: EM — > Tg be a morphism. As discussed in Section 3.3, a tree 
language in EM recognized by r is either of the form fCfc(E') for some E' C E, 
or it is the complement of such a language. From there, and using the same 
reasoning as in the analogous case concerning word languages, one can verify 
that a language L e EM^ is accepted by a preclone in (T3) if and only if 
L is a Boolean combination of languages of the form -^^^(E') (E' C E), or 
equivalently, L is a Boolean combination of languages of the form Lfc(E'), 
E' C E, where Lfc(E) is the set of all E-trees of rank k, for which the set of 
node labels is exactly E'. 

Similarly - and referring again to Section 3.3 for notation - one can give a 
description of the variety of tree languages associated with the pseudovariety 
(Tp), or the pseudovariety (Tp^), using the languages of the form Kk{3p) or 
KkiB'pJ instead of the Kk{3).' 

5.2.2 FO[Succ]-definable tree languages 

In a recent paper [3], Benedikt and Segoufin considered the class of F(9[Succ]- 
definable tree languages. Note that the logical language used in FO[Succ] does 
not allow the predicate <, and FO[Succ] is a fragment of F0[<]. Wc refer 
the reader to [3] for precise definitions, and we point out here that the char- 
acterization established there can be expressed in the framework developed in 
the present paper. 

More precisely, the results of Benedikt and Segoufin establish that FO[Succ]- 
definable tree languages form a variety of languages, and that the correspond- 
ing pseudovariety of preclones consists of the preclones S such that 

(1) the semigroup 5*1 satisfies x''' = x^^^ and exfyezf = ezfyexf for all ele- 
ments e, /, X, y, z such that e = and f = P and for ^ = \Si\\ 

(2) for each x e 6*2, e & Si such that e = e^, and s, t e ^o, x ■ {e ■ s ® e ■ t) ~ 
X ■ {e -t® e ■ s). 

In particular, FO[Succ]-definability is decidable for regular tree languages. 

It is clearly argued in [3] that FO[Succ]-definable tree languages are exactly 
the locally threshold testable languages, for general model-theoretic reasons, 
but that this fact alone does not directly yield a decision procedure. The result 
stated above is analogous to the characterization of FO[Succ] -definability 
for recognizable word languages - more precisely. Condition (1) suffices for 
languages of words and their syntactic semigroups. Condition (2), which makes 
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sense in trees but not in words, must be added to the other one to characterize 
FO[Succ]-definabihty for tree languages. 

5.2.3 Some classes of languages definable in modal logic 

Bojanczyk and Walukiewicz also characterized interesting logically defined 
classes of tree languages [5] . Again, their results are not couched in terms of 
preclones, but they can conveniently be expressed in this way. 

These authors consider three fragments of CTL*: TL(EX), TL(EF) and TL(EX+ 
EF). Here EX (resp. EF) denotes the modality whereby a tree t satisfies EX<^ 
(resp. EF<^) if some child of the root (resp. some node properly below the 
root) of t satisfies (p. The set of formulas constructed using one or both of 
these modalities, plus Boolean operations and letter constants form the logi- 
cal languages rL(EX), rL(EF) and rL(EX + EF). 

Bojanczyk and Walukiewicz first observe that a tree language L is TL(EX)- 
definable if and only if there exists an integer k such that membership of a 
tree t in L depends only on the fragment of t consisting of the nodes of depth 
at most k. They then show that these tree languages form a variety, and the 
corresponding pseudovariety of preclones consists of the preclones S such that 
the semigroup Si satisfies ex — e for each idempotent e. Note that this is 
exactly the same characterization as for languages of finite words [31]. 

For the characterization of TL (EF)-definable languages, let us first define the 
following relation on a preclone 5" : if s, t G we say that s :< t ii s — u-t for 
some e -Si. It is easily verified that ^ is a quasi-order. (The direction of the 
order is reversed from that used by Bojanczyk and Walukiewicz, to enhance 
the analogy with the TZ- and >C-orders in semigroup theory). 

Now let {S, A) be the syntactic pg^-pair of a tree language L C EMq. Then L 
is TL( EF)-definable if and only if 

• 5*1 satisfies the pseudo-identity v{uv)'^ = (uv)'^ (where x'^ designates the 
unique idempotent which is a power of x); that is, is £-trivial, and 
equivalently, the relation :< is an order relation; 

• a • (si ® • • • ® s„) = a ■ (.Sjr(i) © ■ ■ ■ © s^(n)) for each a e An and si, . . . , s„ e -So, 
and for each permutation tt of [n]; 

• a ■ (si © S2 © S3 © ■ ■ • © s„) = a • (s2 © S2 © S3 © • • • © -s„) for each a e A^ 
and Si, . . . , Sn & Sq such that S2 Si, 

• if 6, c e and y e Sp^o are such that, for each d e Ap, we have d • {h • y ® 
■■■®h-y)— d-y = d-{c-y®---®c-y), then a ■ {z ®h ■ y) — a ■ {z ® c ■ y) 
for each a e A„ and z e Sn-i^. 

This characterization directly imphes the decidability of TL (EF)-definability. 
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Bojanczyk and Walukiewicz also give an interesting characterization of the 
TL(EF)-dcfinable languages in terms of so-called type dependency. In partic- 
ular, they show that a tree language is TL(EF)-definable if and only if its 
syntactic preclone S is such that, whenever a is the syntactic equivalence 
class of a letter in E„, and the tiS are syntactic equivalence classes of trees in 
EMo, then the value of a product a ■ (ti © • • • ® i„) depends only on a and on 
the set {t \ ti for some 1 <i < n}. 

The characterization of TL{EX + EF)-definable languages given in [5] can also 
be restated in similar - albeit more complex - terms. 



5.2.4 FO[<]-definable tree languages 

The characterization and decidability of FO[<]-definable regular tree lan- 
guages is an open problem that has attracted some efforts along the years, 
as discussed in the introduction. 

We obtained an algebraic characterization of FO[<]-definable regular tree 
languages in terms of pseudovarieties of preclones, as is reported in [20]. A 
detailed report of this result will appear in [21], and the present paper lays 
the foundations for this proof. 

Let us note here that this characterization is analogous to the characteriza- 
tion of F0[<] -definable languages of finite words in the following sense: it 
is established in [21] that F0[<] -definable tree languages form a variety of 
tree languages, whose associated pseudovariety of preclones is the least pseu- 
dovariety containing the preclone Tg and closed under a suitable notion of 
block product. It was pointed out in Example 2.5 that the rank 1 elements 
of Tg form the 2-element monoid Ui = {1,0}, and it is a classical result of 
language theory that the least pseudovariety of monoids containing Ui and 
closed under block product is associated with the variety of FO[<]-definable 
word languages [37]. 

It is also known that, in the word case, this pseudovariety is exactly that of 
aperiodic monoids, and membership in it is decidable, which shows that F0[< 
]-definability is decidable for recognizable word languages. At the moment, we 
do not have an analogue of this result, and we do not know whether F0[<]- 
definability is decidable for regular tree languages. 

Our result [20,21] actually applies to a larger class of logically defined regular 
tree languages, based on the use of Lindstrom quantifiers. First-order logic is 
thus a particular case of our result, which also yields (for instance) an algebraic 
characterization of first-order logic with modular quantifiers added. 
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